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Abstract Evidential grids have been recently used for mobile object perception. 
The novelty of this article is to propose a perception scheme using prior map knowl- 
edge. A geographic map is considered an additional source of information fused 
with a grid representing sensor data. Yager's rule is adapted to exploit the Dempster- 
Shafer conflict information at large. In order to distinguish stationary and mobile ob- 
jects, a counter is introduced and used as a factor for mass function specialisation. 
Contextual discounting is used, since we assume that different pieces of information 
become obsolete at different rates. Tests on real-world data are also presented. 



1 Introduction 

Autonomous driving has been an important challenge in recent years. Navigation 
and precise localisation aside, environment perception is an important on-board sys- 
tem of a self-driven vehicle. The level of difficulty in autonomous driving increases 
in urban environments, where a good scene understanding makes the perception 
subsystem crucial. There are several reasons that make cities a demanding environ- 
ment. Poor satellite visibility deteriorates the precision of GPS positioning. Vehicle 
trajectories are hard to predict due to high variation in speed and direction. Also, the 
sheer number of mobile objects poses a problem, e.g. for tracking algorithms. 

On the other hand, more and more detailed and precise geographic databases be- 
come available. This source of information has not been well examined yet, hence 
our approach of incorporating prior knowledge from digital maps in order to im- 
prove perception scheme. A substantial amount of research has focused on the map- 
ping problem for autonomous vehicles, e.g. Simultaneous Localisation and Mapping 
(SLAM) approach, but the use of maps for perception is still understudied. 
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In this article, we propose a data fusion method based on Dempster-Shafer the- 
ory [8] taking into account meta-knowledge obtained from a digital map. We show 
the advantage of including prior knowledge into an embedded perception system of 
an autonomous car. The vehicle environment is modelled by 2D occupancy grids 
proposed in [2]. This paper describes a robust and unified approach to a variety of 
problems in spatial representation using the theory of probability. The theory of ev- 
idence was not combined with occupancy grids until recently to build environment 
maps for robot perception [7]. Only recent works take advantage of the theory of ev- 
idence in the context of mobile perception [4]. Some works use 3D city model as a 
source of prior knowledge for localisation and vision-based perception [1], whereas 
our method uses maps for scene understanding. 

This article is organised as follows. In section 2, we describe the details of the 
method. Section 3 presents the results and section 4 concludes the paper. 



2 Multi-grid fusion approach 

This section presents the proposed perception schemes. The grid construction 
method is described in section 2.2 and all data processing steps are detailed in sec- 
tion 2.4. Figure 1 presents a general overview of our approach. 
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Fig. 1 Method overview (lidar: laser scanner, Applanix: inertial measurement unit). 



2.1 Heterogeneous data sources 

There are three sources in our perception system: vehicle pose, lidar range scanner 
point cloud and vector maps. The vehicle pose comes from the Applanix system 
based on a GPS, an odometer and an IMU. The system is supposed to provide pre- 
cise and integral positioning. Our main source of information about the environment 
is an IBEO Alaska XT lidar able to provide a cloud of about 800 points 10 times per 
second. The digital maps that we use were provided by the French National Geo- 
graphic Institute (IGN) and contain 3D building models as well as the road surface. 
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We also performed successful tests with freely available OpenStreetMap project 2D 
maps [6], but here we limited the use to building data. We assume the maps to be 
precise and accurate. 



2.2 Occupancy grids 

An occupancy grid models the world using a tessellated representation of spatial 
information. In general, it is a multidimensional spatial lattice with cells storing 
some stochastic information. In our case, each cell representing a box (a part of 
environment) X xY where X = [x_ , x+], Y = [y_ , y+] stores a mass function. 

• ScanGrid (SG) construction: In order to process the hdar data, an eviden- 
tial occupancy grid is computed when a new scan arrives, this grid is called 
ScanGrid. Each cell of this grid stores a mass function on the frame of dis- 
cernment (FOD) QsG = {F, O}, where F refers to the free space and O - to the 
occupied space. The basic belief assignment, which reflects the sensor model, is 
described in [4]. 

• MapGrid (MG): To store the results of information fusion, an occupancy grid 
MG has been introduced with a FOD Qmg = {F, C, N, S, V}. Respective classes 
represent: free space F, mapped infrastructure (buildings) C, non-mapped infras- 
tructure A^, temporarily stopped objects S and mobile (moving) V objects. Qmg 
is a common frame used for information fusion. By using MG as a cumulative 
information storage, we are not obliged to aggregate preceding ScanGrids. 

• PriorGrid (PG) context representation: PG allows us to perform a contextual 
information fusion incorporating some meta-knowledge about the environment. 
This grid uses the same frame of discemment Qmg as MG. The grid is obtained 
by projection of map data, buildings and roads, onto a 2D grid with global coor- 
dinates. 

We define two sets of polygons defining the 2D position of buildings and 
road surface by, respectively, ^ = \ bi= ^^^^ ' ' ,i G [O,^^] \ and ^ = 

I lyiyi-'-ymij ) 

= -^i-^s ■■■Xmi g [0 I ^ = 0. Then, we attribute the mass to each 
[yiy2---ymi\ J 

cell of the PriorGrid in the following way: 

We note that B = {C}, R = {F, S, V}, T ^ {F, N, 5, V} for convenience and 
readability only. A denotes aU other strict subsets of Q. These abases charac- 
terise the meta-information inferred from geographic maps. For instance, on the 
road surface R, we encourage the existence of free space F as well as stopped S 
and moving V objects. Analogically, building information B fosters mass trans- 
fer to C. Lastly, T denotes the intermediate area, e.g. pavements, where mobile 
and stationary objects as well as small urban infrastructure can be present. Note 
that neither buildings nor roads are present, so we exclude existence of mapped 
infrastructure C, but we cannot omit other classes. Also, we define a level of con- 
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fidence J3 for each map source, possibly different for each context. Letf = 



mpG{X,Y}{B) 
mpG{X,Y}{R) 
mpG{X,Y}{T) 

mpG{X,Y}iQ)=< 
mpG{X,Y}{A)=0 



pB if{x,y)ebi 
otherwise 

Pr if (xj) e n 
otherwise 

if {x,y) e biV {x,y) e rj 
jSr otherwise 

'1-J3b if {x,y)ebi 
1-I5r if (x J) en- 

1 — J3r otherwise 



Vie [0,nB] 

yiG[0,nB],yje[0,nR] 

yiG[0,nB]yje[0,nR] 
VACQ and A ^{B,R,T} 



(1) 



2.3 Incorporating prior knowledge 

The frame of discernment QsG used in SG is distinct from Qmg, so in order to 
enable the fusion of SG and MG we define a refining rsG ■ 2^sg 2^mg such 
that rsG{{F}) = {F}, rsciiO}) = {C,N,S,V}, rsciA) = {JeeArscie). The re- 
fined mass function can be expressed as m^Q^ {rsG (A)) = mfo^ (A) , VA C QsG- 
Then, Dempster's rule is applied in order to exploit the prior information included 
in PriorGrid: 



m 



SG,t 



^SG. 



^MG 

PG 



(2) 



2.4 Temporal fusion 



Computing conflict masses 



We use the idea from [5] to distinguish between two types of conflict, which arise 
from the fact that the envirormient is dynamic. We denote Htpo the conflict induced 
when a free cell in MG is fused with an occupied cell in SG. Similarly, doF in- 
dicates the conflicted caused by an occupied cell in MG fused with a free cell in 
SG. In an error-free case, these conflicts represent, respectively, the disappearance 
and the appearance of an object. Conflict masses are calculated using the formu- 
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las: mMG,t {I^of) = mMG,t-i {0)-msG,t {F), mMG,t (0fo) = mMG,t-i {F)-msG,t (O), 
where m{0) = i;m(A), VA C {C,N,S,V}. 

A 



MapGrld specialisation using a counter 

Mobile object detection is an important issue in dynamic environments. We propose 
the introduction of a counter ^ in each cell in order to include temporal information 
on the cell occupancy. For this purpose, incrementation and decrementation steps 
Sine G [0, 1], Sdec S [0, 1], as Well as threshold values jo, 7@ have been defined. 

= min (^1, ^('"1) + 5i„c) if mMG{0) > Jo and otmg (0fo) +otmg (0of) < 70 
C^^) = max {q, ^('"1) - 5de^ if mmg i®Fo) +mMG (^of) > 70 

Otherwise ^(/) rests unchanged. Using ^ values, we impose a speciahsation of 
mass functions in MG using the equation: 

m'MG,, (A) = S{A,B)-mMGAB) 0) 
where speciahsation matrix •) is defined as: 



S{A\{V},A)^C \/A CQmg and {V}e A 

S{A,A) = l-^ VACi^Mcand {y}eA 

S{A, A) = 1 VA C Qmg and {y } ^ a 
S{-,-) = otherwise (4) 



Fusion rule 

An important part of the method consists in fusing a discounted and specialized MG 
(see section 2.5 and preceding paragraph) with a SG combined with prior knowledge 
(see section 2.3). 

mMG,t =" m'MG,t-i ® m'sGj (5) 

The fusion rule ® is a modified Yager's rule [10] adapted to mobile object de- 
tection. There are of course many different rules that could be used, but in order 
to distinguish between moving and stationary objects some modifications had to be 
included. These modifications consist in transferring the mass corresponding to a 
newly appeared object dpo to the class of moving objects V as described by the 
equation 6. Symbol (n) denotes the conjunctive fusion rule. 
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(otI ®OT2) (A) = {mi @W2) (A) 

(wi ® mi) (y) = (mi ©mi) (V) + (mi ©mi) (0/ro) 
(mi®m2)(i2) = (mi©m2)(i2) + (mi©m2)(0oF) 

(mi®m2)(0fo) =0 
(mi ®m2) (0of) = 



yACQAAj^V 



(6) 



All the above steps allow us to construct a MapGrid containing reach informa- 
tion on the environment state, including the knowledge on mobile and static objects. 



2.5 Contextual discounting 

Information discounting allows to forget information which is no longer valid. Dis- 
counting parameter a serves to model the speed with which information becomes 
obsolete. Thanks to the contextual discounting [3], we make use of more detailed 
information regarding the confidence we have in the source in various contexts. We 
noticed that different pieces of information become obsolete with different speed. 
Hence, the coarsening used is = 0f/jnamio with e^taHc = {C,N}, 

^dynamic = {S,V}, Ofree = {F}, and disCOUnt rates a = [astatic, adynamic, afree}-"^^ 

assign higher discount rates (lower confidence) to rapidly changing contexts such as 
free space, stopped and moving objects, and lower rates to the static context. The 
discounted mass function is obtained by the disjunctive combination of the input 
mass function mmg and mass functions for each element of the partition 0. 



where each mass function m/ (/ = static, dynamic, free) is defined by m; (0/) = a;, 
mi (0) = 1 - a/, m/(A) = 0, VA C X2 AA ^ {0, 0,}. 



3 Results 
3.1 Setup 

The data set used for our experiments was acquired in cooperation with IGN in 
Paris. The overall length of the trajectory was about 3 km. The size of the grid cell 
in the occupancy grids was set to 0.5 m, which is sufficient to model a complex 
environment with mobile objects. The discount rates a describing the speed of in- 
formation becoming obsolete were defined empirically, but they can be learnt from 
data, as proposed in [3]. We have defined the map confidence factor j3 by ourselves, 
but ideally, it should be given by the map provider. j3 describes data currentness 



a 



mMG,t =mMG,t@ ^static 

@mdynamic@mf 



(7) 



ree 
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Fig. 2 (a) Scene, (b) PG. (c) MG without prior information, (d) MG with prior map knowledge. 

(age), errors introduced by geometry simplification and spatial discretisation. /3 can 
also be used to depict the localisation accuracy. Other parameters, such as counter 
steps 8i„c, ddec and thresholds yo, 70 used for mobile object detection determine the 
sensitiveness of mobile object detection and were set by manual tuning. 



3.2 Impact of prior knowledge 

The results for a particular instant of the approach tested on real-world data are 
presented on figure 2. The visualisation of the MG has been obtained by calculating 
the pignistic probability of each class [9]. The presented scene contains two cars 
(only one is visible in the camera image) going in the direction opposite to the test 
vehicle and a bus parked on the road edge. Bus and car positions are marked on the 
grids by green and red boxes, respectively. The test vehicle position is shown as a 
blue box. Different classes of ^mg are represented by different colours: F - white, 
C,N - blue, S - green and V - red. PG on figure 2(b) shows the position of the road 
space (white) and buildings (blue). 

The principal advantage gained by using map knowledge is richer information 
on the detected objects. A clear difference between a moving object (red, car) and 
a stopped one (green, bus) is visible. Also, stopped objects are distinct from in- 
frastructure when prior map information is available (cf. figures 2(c) and 2(d). In 
addition, thanks to the prior knowledge, stationary objects (cyan) such as infras- 
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tructure are distinguished from stopped objects on the road. Grids make noticeable 
the effect of discounting, as information on the environment behind the vehicle is 
being forgotten. On the other hand, the parked bus is still in evidence despite being 
occluded by the passing car. 



4 Conclusion and perspectives 

A new mobile perception scheme based on prior map knowledge has been intro- 
duced. Geographic information is exploited to reduce the number of possible hy- 
potheses delivered by an exteroceptive source. A modified fusion rule taking into 
account the existence of mobile objects has been defined. Furthermore, the vari- 
ation in information lifetime has been modelled by the introduction of contextual 
discounting. In the future, we anticipate removing the hypothesis that the map is 
accurate. This approach will entail considerable work on creating appropriate er- 
ror models for the data source. Moreover, we envision differentiating the free space 
class into two complementary classes to distinguish navigable and non-navigable 
space. This will be a step towards the use of our approach in autonomous naviga- 
tion. Another perspective is the use of reference data to validate the results, choose 
the most appropriate fusion rule and learn algorithm parameters. We envision using 
map information to predict object movements. It rests also a future work to exploit 
fully the 3D map information. 
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